Stable model semantics
The concept of a stable model, or answer set, is used to define a declarative semantics for logic programs with negation as failure. This is one of several standard approaches to the meaning of negation in logic programming, along with program completion and the well-founded semantics. The stable model semantics is the basis of answer set programming.
Motivation
[edit]Research on the declarative semantics of negation in logic programming was motivated by the fact that the behavior of SLDNF resolution—the generalization of SLD resolution used by Prolog in the presence of negation in the bodies of rules—does not fully match the truth tables familiar from classical propositional logic. Consider, for instance, the program
Given this program, the query p will succeed, because the program includes p as a fact; the query q will fail, because it does not occur in the head of any of the rules. The query r will fail also, because the only rule with r in the head contains the subgoal q in its body; as we have seen, that subgoal fails. Finally, the query s succeeds, because each of the subgoals p, succeeds. (The latter succeeds because the corresponding positive goal q fails.) To sum up, the behavior of SLDNF resolution on the given program can be represented by the following truth assignment:
p q r s T F F T.
On the other hand, the rules of the given program can be viewed as propositional formulas if we identify the comma with conjunction , the symbol with negation , and agree to treat as the implication written backwards. For instance, the last rule of the given program is, from this point of view, alternative notation for the propositional formula
If we calculate the truth values of the rules of the program for the truth assignment shown above then we will see that each rule gets the value T. In other words, that assignment is a model of the program. But this program has also other models, for instance
p q r s T T T F.
Thus one of the models of the given program is special in the sense that it correctly represents the behavior of SLDNF resolution. What are the mathematical properties of that model that make it special? An answer to this question is provided by the definition of a stable model.
Relation to nonmonotonic logic
[edit]The meaning of negation in logic programs is closely related to two theories of nonmonotonic reasoning—autoepistemic logic and default logic. The discovery of these relationships was a key step towards the invention of the stable model semantics.
The syntax of autoepistemic logic uses a modal operator that allows us to distinguish between what is true and what is known. Michael Gelfond [1987] proposed to read in the body of a rule as " is not known", and to understand a rule with negation as the corresponding formula of autoepistemic logic. The stable model semantics, in its basic form, can be viewed as a reformulation of this idea that avoids explicit references to autoepistemic logic.
In default logic, a default is similar to an inference rule, except that it includes, besides its premises and conclusion, a list of formulas called justifications. A default can be used to derive its conclusion under the assumption that its justifications are consistent with what is currently known. Nicole Bidoit and Christine Froidevaux [1987] proposed to treat negated atoms in the bodies of rules as justifications. For instance, the rule
can be understood as the default that allows us to derive from assuming that is consistent. The stable model semantics uses the same idea, but it does not explicitly refer to default logic.
Stable models
[edit]The definition of a stable model below, reproduced from [Gelfond and Lifschitz, 1988], uses two conventions. First, a truth assignment is identified with the set of atoms that get the value T. For instance, the truth assignment
p q r s T F F T.
is identified with the set . This convention allows us to use the set inclusion relation to compare truth assignments with each other. The smallest of all truth assignments is the one that makes every atom false; the largest truth assignment makes every atom true.
Second, a logic program with variables is viewed as shorthand for the set of all ground instances of its rules, that is, for the result of substituting variable-free terms for variables in the rules of the program in all possible ways. For instance, the logic programming definition of even numbers
is understood as the result of replacing X in this program by the ground terms
in all possible ways. The result is the infinite ground program
Definition
[edit]Let P be a set of rules of the form
where are ground atoms. If P does not contain negation ( in every rule of the program) then, by definition, the only stable model of P is its model that is minimal relative to set inclusion.[1] (Any program without negation has exactly one minimal model.) To extend this definition to the case of programs with negation, we need the auxiliary concept of the reduct, defined as follows.
For any set I of ground atoms, the reduct of P relative to I is the set of rules without negation obtained from P by first dropping every rule such that at least one of the atoms in its body
belongs to I, and then dropping the parts from the bodies of all remaining rules.
We say that I is a stable model of P if I is the stable model of the reduct of P relative to I. (Since the reduct does not contain negation, its stable model has been already defined.) As the term "stable model" suggests, every stable model of P is a model of P.
Example
[edit]To illustrate these definitions, let us check that is a stable model of the program
The reduct of this program relative to is
(Indeed, since , the reduct is obtained from the program by dropping the part ) The stable model of the reduct is . (Indeed, this set of atoms satisfies every rule of the reduct, and it has no proper subsets with the same property.) Thus after computing the stable model of the reduct we arrived at the same set that we started with. Consequently, that set is a stable model.
Checking in the same way the other 15 sets consisting of the atoms shows that this program has no other stable models. For instance, the reduct of the program relative to is
The stable model of the reduct is , which is different from the set that we started with.
Programs without a unique stable model
[edit]A program with negation may have many stable models or no stable models. For instance, the program
has two stable models , . The one-rule program
has no stable models.
If we think of the stable model semantics as a description of the behavior of Prolog in the presence of negation then programs without a unique stable model can be judged unsatisfactory: they do not provide an unambiguous specification for Prolog-style query answering. For instance, the two programs above are not reasonable as Prolog programs—SLDNF resolution does not terminate on them.
But the use of stable models in answer set programming provides a different perspective on such programs. In that programming paradigm, a given search problem is represented by a logic program so that the stable models of the program correspond to solutions. Then programs with many stable models correspond to problems with many solutions, and programs without stable models correspond to unsolvable problems. For instance, the eight queens puzzle has 92 solutions; to solve it using answer set programming, we encode it by a logic program with 92 stable models. From this point of view, logic programs with exactly one stable model are rather special in answer set programming, like polynomials with exactly one root in algebra.
Properties of the stable model semantics
[edit]In this section, as in the definition of a stable model above, by a logic program we mean a set of rules of the form
where are ground atoms.
- Head atoms
- If an atom A belongs to a stable model of a logic program P then A is the head of one of the rules of P.
- Minimality
- Any stable model of a logic program P is minimal among the models of P relative to set inclusion.
- The antichain property
- If I and J are stable models of the same logic program then I is not a proper subset of J. In other words, the set of stable models of a program is an antichain.
- NP-completeness
- Testing whether a finite ground logic program has a stable model is NP-complete.
Relation to other theories of negation as failure
[edit]Program completion
[edit]Any stable model of a finite ground program is not only a model of the program itself, but also a model of its completion [Marek and Subrahmanian, 1989]. The converse, however, is not true. For instance, the completion of the one-rule program
is the tautology . The model of this tautology is a stable model of , but its other model is not. François Fages [1994] found a syntactic condition on logic programs that eliminates such counterexamples and guarantees the stability of every model of the program's completion. The programs that satisfy his condition are called tight.
Fangzhen Lin and Yuting Zhao [2004] showed how to make the completion of a nontight program stronger so that all its nonstable models will be eliminated. The additional formulas that they add to the completion are called loop formulas.
Well-founded semantics
[edit]The well-founded model of a logic program partitions all ground atoms into three sets: true, false and unknown. If an atom is true in the well-founded model of then it belongs to every stable model of . The converse, generally, does not hold. For instance, the program
has two stable models, and . Even though belongs to both of them, its value in the well-founded model is unknown.
Furthermore, if an atom is false in the well-founded model of a program then it does not belong to any of its stable models. Thus the well-founded model of a logic program provides a lower bound on the intersection of its stable models and an upper bound on their union.
Strong negation
[edit]Representing incomplete information
[edit]From the perspective of knowledge representation, a set of ground atoms can be thought of as a description of a complete state of knowledge: the atoms that belong to the set are known to be true, and the atoms that do not belong to the set are known to be false. A possibly incomplete state of knowledge can be described using a consistent but possibly incomplete set of literals; if an atom does not belong to the set and its negation does not belong to the set either then it is not known whether is true or false.
In the context of logic programming, this idea leads to the need to distinguish between two kinds of negation—negation as failure, discussed above, and strong negation, which is denoted here by .[2] The following example, illustrating the difference between the two kinds of negation, belongs to John McCarthy. A school bus may cross railway tracks under the condition that there is no approaching train. If we do not necessarily know whether a train is approaching then the rule using negation as failure
is not an adequate representation of this idea: it says that it's okay to cross in the absence of information about an approaching train. The weaker rule, that uses strong negation in the body, is preferable:
It says that it's okay to cross if we know that no train is approaching.
Coherent stable models
[edit]To incorporate strong negation in the theory of stable models, Gelfond and Lifschitz [1991] allowed each of the expressions , , in a rule
to be either an atom or an atom prefixed with the strong negation symbol. Instead of stable models, this generalization uses answer sets, which may include both atoms and atoms prefixed with strong negation.
An alternative approach [Ferraris and Lifschitz, 2005] treats strong negation as a part of an atom, and it does not require any changes in the definition of a stable model. In this theory of strong negation, we distinguish between atoms of two kinds, positive and negative, and assume that each negative atom is an expression of the form , where is a positive atom. A set of atoms is called coherent if it does not contain "complementary" pairs of atoms . Coherent stable models of a program are identical to its consistent answer sets in the sense of [Gelfond and Lifschitz, 1991].
For instance, the program
has two stable models, and . The first model is coherent; the second is not, because it contains both the atom and the atom .
Closed world assumption
[edit]According to [Gelfond and Lifschitz, 1991], the closed world assumption for a predicate can be expressed by the rule
(the relation does not hold for a tuple if there is no evidence that it does). For instance, the stable model of the program
consists of 2 positive atoms
and 14 negative atoms
i.e., the strong negations of all other positive ground atoms formed from .
A logic program with strong negation can include the closed world assumption rules for some of its predicates and leave the other predicates in the realm of the open world assumption.
Programs with constraints
[edit]The stable model semantics has been generalized to many kinds of logic programs other than collections of "traditional" rules discussed above—rules of the form
where are atoms. One simple extension allows programs to contain constraints—rules with the empty head:
Recall that a traditional rule can be viewed as alternative notation for a propositional formula if we identify the comma with conjunction , the symbol with negation , and agree to treat as the implication written backwards. To extend this convention to constraints, we identify a constraint with the negation of the formula corresponding to its body:
We can now extend the definition of a stable model to programs with constraints. As in the case of traditional programs, to define stable models, we begin with programs that do not contain negation. Such a program may be inconsistent; then we say that it has no stable models. If such a program is consistent then has a unique minimal model, and that model is considered the only stable model of .
Next, stable models of arbitrary programs with constraints are defined using reducts, formed in the same way as in the case of traditional programs (see the definition of a stable model above). A set of atoms is a stable model of a program with constraints if the reduct of relative to has a stable model, and that stable model equals .
The properties of the stable model semantics stated above for traditional programs hold in the presence of constraints as well.
Constraints play an important role in answer set programming because adding a constraint to a logic program affects the collection of stable models of in a very simple way: it eliminates the stable models that violate the constraint. In other words, for any program with constraints and any constraint , the stable models of can be characterized as the stable models of that satisfy .
Disjunctive programs
[edit]In a disjunctive rule, the head may be the disjunction of several atoms:
(the semicolon is viewed as alternative notation for disjunction ). Traditional rules correspond to , and constraints to . To extend the stable model semantics to disjunctive programs [Gelfond and Lifschitz, 1991], we first define that in the absence of negation ( in each rule) the stable models of a program are its minimal models. The definition of the reduct for disjunctive programs remains the same as before. A set of atoms is a stable model of if is a stable model of the reduct of relative to .
For example, the set is a stable model of the disjunctive program
because it is one of two minimal models of the reduct
The program above has one more stable model, .
As in the case of traditional programs, each element of any stable model of a disjunctive program is a head atom of , in the sense that it occurs in the head of one of the rules of . As in the traditional case, the stable models of a disjunctive program are minimal and form an antichain. Testing whether a finite disjunctive program has a stable model is -complete [Eiter and Gottlob, 1993].
Stable models of a set of propositional formulas
[edit]Rules, and even disjunctive rules, have a rather special syntactic form, in comparison with arbitrary propositional formulas. Each disjunctive rule is essentially an implication such that its antecedent (the body of the rule) is a conjunction of literals, and its consequent (head) is a disjunction of atoms. David Pearce [1997] and Paolo Ferraris [2005] showed how to extend the definition of a stable model to sets of arbitrary propositional formulas. This generalization has applications to answer set programming.
Pearce's formulation looks very different from the original definition of a stable model. Instead of reducts, it refers to equilibrium logic—a system of nonmonotonic logic based on Kripke models. Ferraris's formulation, on the other hand, is based on reducts, although the process of constructing the reduct that it uses differs from the one described above. The two approaches to defining stable models for sets of propositional formulas are equivalent to each other.
General definition of a stable model
[edit]According to [Ferraris, 2005], the reduct of a propositional formula relative to a set of atoms is the formula obtained from by replacing each maximal subformula that is not satisfied by with the logical constant (false). The reduct of a set of propositional formulas relative to consists of the reducts of all formulas from relative to . As in the case of disjunctive programs, we say that a set of atoms is a stable model of if is minimal (with respect to set inclusion) among the models of the reduct of relative to .
For instance, the reduct of the set
relative to is
Since is a model of the reduct, and the proper subsets of that set are not models of the reduct, is a stable model of the given set of formulas.
We have seen that is also a stable model of the same formula, written in logic programming notation, in the sense of the original definition. This is an instance of a general fact: in application to a set of (formulas corresponding to) traditional rules, the definition of a stable model according to Ferraris is equivalent to the original definition. The same is true, more generally, for programs with constraints and for disjunctive programs.
Properties of the general stable model semantics
[edit]The theorem asserting that all elements of any stable model of a program are head atoms of can be extended to sets of propositional formulas, if we define head atoms as follows. An atom is a head atom of a set of propositional formulas if at least one occurrence of in a formula from is neither in the scope of a negation nor in the antecedent of an implication. (We assume here that equivalence is treated as an abbreviation, not a primitive connective.)
The minimality and the antichain property of stable models of a traditional program do not hold in the general case. For instance, (the singleton set consisting of) the formula
has two stable models, and . The latter is not minimal, and it is a proper superset of the former.
Testing whether a finite set of propositional formulas has a stable model is -complete, as in the case of disjunctive programs.
See also
[edit]Notes
[edit]- ^ This approach to the semantics of logic programs without negation is due to Maarten van Emden and Robert Kowalski—van Emden & Kowalski 1976.
- ^ Gelfond & Lifschitz 1991 call the second negation classical and denote it by .
References
[edit]- Bidoit, N.; Froidevaux, C. (1987). "Minimalism subsumes default logic and circumscription". Proceedings: Symposium on Logic in Computer Science, Ithaca, New York, June 22-25, 1987. IEEE Computer Society Press. pp. 89–97. ISBN 978-0-8186-0793-6. 87CH2464-6.
- Eiter, T.; Gottlob, G. (1993). "Complexity results for disjunctive logic programming and application to nonmonotonic logics". ILPS '93: Proceedings of the 1993 international symposium on Logic programming. MIT Press. pp. 266–278. ISBN 978-0-262-63152-5.
- van Emden, M.; Kowalski, R. (1976). "The semantics of predicate logic as a programming language" (PDF). Journal of the ACM. 23 (4): 733–742. CiteSeerX 10.1.1.64.9246. doi:10.1145/321978.321991. S2CID 11048276.
- Fages, F. (1994). "Consistency of Clark's completion and existence of stable models". Journal of Methods of Logic in Computer Science. 1: 51–60. CiteSeerX 10.1.1.48.2157.
- Ferraris, P. (2005). "Answer sets for propositional theories". Logic Programming and Nonmonotonic Reasoning. LPNMR 2005. Lecture Notes in Computer Science. Vol. 3662. Springer. pp. 119–131. CiteSeerX 10.1.1.129.5332. doi:10.1007/11546207_10. ISBN 978-3-540-31827-9.
- Ferraris, P.; Lifschitz, V. (2005). "Mathematical foundations of answer set programming". We Will Show Them! Essays in Honour of Dov Gabbay. King's College Publications. pp. 615–664. CiteSeerX 10.1.1.79.7622.
- Gelfond, M. (1987). "On stratified autoepistemic theories" (PDF). AAAI'87: Proceedings of the sixth National conference on Artificial intelligence. pp. 207–211. ISBN 978-0-934613-42-2.
- Gelfond, M.; Lifschitz, V. (1988). "The stable model semantics for logic programming". Proceedings of the Fifth International Conference on Logic Programming (ICLP). MIT Press. pp. 1070–80. ISBN 978-0-262-61054-4.
- Gelfond, M.; Lifschitz, V. (1991). "Classical negation in logic programs and disjunctive databases". New Generation Computing. 9 (3–4): 365–385. CiteSeerX 10.1.1.49.9332. doi:10.1007/BF03037169. S2CID 13036056.
- Hanks, S.; McDermott, D. (1987). "Nonmonotonic logic and temporal projection". Artificial Intelligence. 33 (3): 379–412. doi:10.1016/0004-3702(87)90043-9.
- Lin, F.; Zhao, Y. (2004). "ASSAT: Computing answer sets of a logic program by SAT solvers" (PDF). Artificial Intelligence. 157 (1–2): 115–137. doi:10.1016/j.artint.2004.04.004. S2CID 514581.
- Marek, V.; Subrahmanian, V.S. (1989). "The relationship between logic program semantics and non-monotonic reasoning". Logic Programming: Proceedings of the Sixth International Conference. MIT Press. pp. 600–617. ISBN 978-0-262-62065-9.
- Pearce, D. (1997). "A new logical characterization of stable models and answer sets" (PDF). Non-Monotonic Extensions of Logic Programming. Lecture Notes in Artificial Intelligence. Vol. 1216. pp. 57–70. doi:10.1007/BFb0023801. ISBN 978-3-540-68702-3.
- Reiter, R. (1980). "A logic for default reasoning" (PDF). Artificial Intelligence. 13 (1–2): 81–132. doi:10.1016/0004-3702(80)90014-4.